Shallow Semantic Annotation of Bulgarian
نویسندگان
چکیده
The paper discusses shallow semantic annotation of Bulgarian treebank. Our goal is to construct the next layer of linguistic interpretation over the morphological and syntactic layers that have already been encoded in the treebank. The annotation is called shallow because it encodes only the senses for the non-functional words and the relations between the semantic indices connected to them. We do not encode quantifiers and scope information. An ontology is employed as a stock of the concepts and relations that form the word senses. Our lexicon is based on the Generative Lexicon (GL) model (Pustejovsky 1995) as it was implemented in the SIMPLE project (Lenci et. al. 2000). GL defines the way in which the words are connected to the concepts and the relations in the ontology. Also it provides mechanisms for literal sense changes like type-coercion, metonymy, and similar. Some of these phenomena are presented in the
منابع مشابه
New Applications of “Ontology-to-Text Relation” Strategy for Bulgarian Language
The paper presents new applications of the Ontology-to-Text Relation Strategy to Bulgarian Iconographic Domain. First the strategy itself is discussed within the triple ontology-terminological lexicon-annotation grammars, then – the related works. Also, the specificics of the semantic annotation and evaluation over iconographic data are presented. A family of domain ontologies over the iconogra...
متن کاملBulgarian Language Resources for Ontology-Based Semantic Search
This paper presents the language resources, which would facilitate the ontology-based semantic search. Some of these resources are language independent, such as the domain ontology. Some depend on the specific language: terminological lexicons, annotation grammars, sense disambiguation rules, gold standard corpus. Here we focus on the Bulgarian resources constructed in two domains for supportin...
متن کاملOntology-Based Lexicon of Bulgarian
In contrast to morphological and syntactic processing semantic annota tion based on domain ontology is still underdeveloped for Bulgarian. On the other hand, the prerequisites for an ontological annotation are already available. These are as follows: a morphosyntactic tagger for Bulgarian with more than 95% accuracy; a dependency parser with more than 84% accura cy; a general chunker and a na...
متن کاملSemantic-Based Image Retrial in the VQ Compressed Domain using Image Annotation Statistical Models
متن کامل
Linguistic Issues in Language Technology – LiLT
The paper describes the construction of a Bulgarian-English treebank aligned on the word and semantic level. We consider the manual word level alignment easier and more reliable than the manual alignment on syntactic and semantic level. Thus, after manual word level alignment we apply an automatic procedure for the construction of semantic level alignments. Our work presents the main steps of t...
متن کامل